Towards very large vocabulary word recognition

نویسنده

A. Waibel

چکیده

i In mis paper, preliminary considerations and some experimental results are presented in an effort to design Very Large Vocabulary Recognition (VLVR) systems. We will first consider the applicability of current recognition techniques and argue their inadequacy for VLVR. Possible alternate strategies will be explored and their potential usefulness statistically evaluated. Our results indicate that suprasegmental cues such as syllabification, stress patterns, rhythmic patterns and the voiced unvoiced patterns in the syllables of a word provide powerful mechanisms for search space reduction. Suprasegmental features could thus operate in a complementary fashion to segmental features. V

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An A* algorithm for very large vocabulary continuous speech recognition

We present a new search algorithm for very large vocabulary continuous speech recognition. Continuous speech recognition with this algorithm is only about 10 times more computationally expensive than isolated word recognition. We report preliminary recognition results obtained by testing our recognizer on "books on tape" using a 60,000 word dictionary.

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Performance Through Consistency: MS-TDNN's for Large Vocabulary Continuous Speech Recognition

Connectionist Rpeech recognition systems are often handicapped by an inconsistency between training and testing criteria. This problem is addressed by the Multi-State Time Delay Neural Network (MS-TDNN), a hierarchical phonf'mp and word classifier which uses DTW to modulate its connectivit.y pattern, and which is directly trained on word-level targets. The consistent use of word accuracy as a c...

متن کامل

Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-Based Consensus Networks

Word-based consensus networks have been verified to be very useful in minimizing word error rates (WER) for large vocabulary continuous speech recognition for western languages. By considering the special structure of Chinese language, this paper points out that character-based rather then wordbased consensus networks should work better for Chinese language. This was verified by extensive exper...

متن کامل

Turkish LVCSR: towards better speech recognition for agglutinative languages

The Turkish language belongs to the Turkic family. All members of this family are close to one another in terms of linguistic structure. Typological similarities are vowel harmony, verb-final word order and agglutinative morphology. This latter property causes a very fast vocabulary growth resulting in a large number of out-of-vocabulary words. In this paper we describe our first experiments in...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Towards very large vocabulary word recognition

نویسنده

چکیده

منابع مشابه

An A* algorithm for very large vocabulary continuous speech recognition

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Performance Through Consistency: MS-TDNN's for Large Vocabulary Continuous Speech Recognition

Improved Large Vocabulary Continuous Chinese Speech Recognition by Character-Based Consensus Networks

Turkish LVCSR: towards better speech recognition for agglutinative languages

عنوان ژورنال:

اشتراک گذاری